arXiv:1501.00562v3 [math.PR] 27Jun2016 


The Annals of Applied Probability 
2016, Vol. 26, No. 3, 1774-1806 
DOI: 10.1214/15-AAP1133 

(c) Institute of Mathematical Statistics, 2016 


ENTROPIC RICCI CURVATURE BOUNDS FOR DISCRETE 
INTERACTING SYSTEMS^ 

By Max Fathi and Jan Maas 

Universite Paris 6 and Institute of Science and Technology Austria 

We develop a new and systematic method for proving entropic 
Ricci curvature lower bounds for Markov chains on discrete sets. Us¬ 
ing different methods, such bounds have recently been obtained in 
several examples (e.g., 1-dimensional birth and death chains, product 
chains, Bernoulli-Laplace models, and random transposition models). 

However, a general method to obtain discrete Ricci bounds had been 
lacking. Our method covers all of the examples above. In addition, we 
obtain new Ricci curvature bounds for zero-range processes on the 
complete graph. The method is inspired by recent work of Caputo, 

Dai Pra and Posta on discrete functional inequalities. 

1. Introduction. Ricci curvature lower bounds for Riemannian manifolds 
play a crucial role in differential geometry, but also in probability theory and 
analysis. In particular, such bounds are known to imply important proper¬ 
ties of Brownian motion on the manifold. For example, positive curvature 
bounds imply several useful functional inequalities (logarithmic Sobolev in¬ 
equality, Poincare inequality), as well as exponential rates of convergence 
to equilibrium. Such results were first obtained using the celebrated Bakry- 
Emery approach developed in [1]. 

1.1. Discrete entropic Ricci curvature. In [18] and [9] a notion of Ricci 
curvature bounds for Markov chains was introduced, based on convexity 
properties of the relative entropy along geodesics in the space of probabil¬ 
ity measures over the state space, for a well-chosen metric structure. It is 
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the discrete analogue of the synthetic notion of Ricci curvature bounds in 
geodesic metric-measure spaces obtained in the contributions of Lott-Villani 
[17] and Sturm [24], 

The main observation behind the Lott-Villani-Sturm definition of Ricci 
curvature bounds is that, on a Riemannian manifold A4, the Ricci curvature 
is bounded from below by some constant k € M if and only if the Boltzmann- 
Shannon entropy ?{(p) = f plogpdvol is /t-convex along geodesics in the L^- 
Wasserstein space of probability measures on A4. However, this dehnition is 
not well adapted to discrete spaces, because the L^-Wasserstein space over 
a discrete space does not contain any geodesic. 

To circumvent this issue, [18] and [20] introduced a different metric W. 
This metric is built from a Markov kernel in such a way that the heat flow 
associated to this kernel is the gradient flow of the entropy with respect to 
the distance W. This is an analogue of the continuous situation, where it has 
been proven by Jordan, Kinderlehrer and Otto in [15] that the heat flow on 
is the gradient flow of the entropy with respect to the Wasserstein metric. 
A Markov chain is then said to have Ricci curvature bounded from below 
by some constant k if the relative entropy (with respect to the invariant 
probability measure of the chain) is K-convex along geodesics for the metric 
W. 

It has been shown in [9] that a Ricci curvature lower bound has signif¬ 
icant consequences for the associated Markov chain, such as a (modified) 
logarithmic Sobolev inequality, a Talagrand transportation inequality and a 
Poincare inequality. Therefore, it is desirable to obtain sharp Ricci curvature 
bounds in concrete discrete examples. To this day, there are very few results 
of this type. 

Mielke [20] obtained Ricci curvature bounds for one-dimensional birth and 
death processes, and applied these bounds to discretizations of 1-dimensional 
Fokker-Planck equations. Erbar and Maas [9] proved a tensorization princi¬ 
ple: the Ricci curvature lower bound for a product system is the minimum 
of the Ricci curvature bounds of the components. This property is crucial 
for applications to high-dimensional problems. As a special case, the result 
yields the sharp Ricci bound for the discrete hypercube {0,1}"'. The first 
high-dimensional results beyond the product-setting have been recently ob¬ 
tained by Erbar, Maas and Tetali [11], who obtained Ricci curvature bounds 
for the Bernoulli-Laplace model and for the random transposition model on 
the symmetric group. 

In spite of this progress, a systematic method for obtaining discrete Ricci 
curvature bounds has been lacking. In this paper, we present such a method, 
which allows us to obtain curvature bounds for several interacting particle 
systems on the complete graph. The method allows us to recover all of the 
above-mentioned results, and yields new curvature bounds for zero-range 
processes on the complete graph. 


RICCI BOUNDS FOR DISCRETE SYSTEMS 


3 


1.2. The Bochner-approach. To explain our method, let us recall that 
the discrete Ricci curvature bounds considered in this paper correspond to 
bounds on the second derivative of the entropy along W-geodesics. In the 
continuous setting, on a Riemannian manifold Al, the second derivative of 
the entropy along a 2-Wasserstein geodesic {pt) is formally given by 

(1) VAV'i) + ^A(|p) dpt, 

where -0 : [0,1] x A1 ^ M satisfies the continuity equation 9fp + V ■ {pVijj) = 0 
and the Hamilton-Jacobi equation dtijj + ^|VV’P = 0. Note that the inte¬ 
grand above can be reformulated using Bochner’s formula, which yields 

(2) - VA^Pt) + iA(| VV^t|2) = + Ric(VV^, W). 

Therefore, it follows at least formally that a lower bound Ric > n implies 
K-convexity of the entropy along geodesics. 

In the discrete case, the second derivative of the entropy along geodesics 
is given by an expression B{p,'ip) which somewhat resembles the right-hand 
side of (1), but the dependence on p is more complicated. Unfortunately, 
there does not seem to be a suitable discrete analogue of Bochner’s formula. 

In this paper, we present an approach to get around this difficulty. We 
obtain a convenient lower bound for the second derivative of the entropy, 
which turns out to be very useful; see Theorem 3.5 below. This expression has 
the same features as the right-hand side of Bochner’s formula (2): one of the 
terms is nonnegative and contains only second-order (discrete) derivatives, 
while the other terms contain only first-order derivatives. The expression is 
remarkably flexible, since it allows us to derive sharp Ricci bounds in the 
rather diverse examples mentioned above. 

The work of Caputo, Dai Pra and Posta [5] which inspired this paper, 
fits very naturally into this framework. In order to obtain modified logarith¬ 
mic Sobolev inequalities, these authors followed the Bakry-Emery strategy, 
which consists of computing the second derivative of the entropy along the 
“heat equation” dtp = Cp, where £ is the generator of a reversible Markov 
chain. It can be checked that this second derivative can be expressed as 
2B{p,\ogp). Therefore, the bounds obtained in [5] and [8] are a special case 
of the bounds on B{p,'ip), which we obtain for arbitrary ■0. We refer to 
Section 2.3 for more details. 

1.3. Organization and notation. In Section 2, we collect preliminaries 
on discrete transport metrics, the associated Riemannian structure, and the 
notion of discrete entropic Ricci curvature. Section 3 contains the general 
criterion for Ricci curvature bounds. Various examples are studied in Sec¬ 
tion 4. The Appendix states a few useful properties of the logarithmic mean 
that are used in the proofs. 
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Throughout the paper, we use the probabilistic notation 
7r[F] = 7r[F(r/)] = ^ F{T])7r{r]). 

2. Transport metrics and discrete Ricci cnrvatnre. In this section, we 
briefly recall some basic facts on the discrete transportation metric W, which 
plays a crucial role in the paper. This metric has been introduced in [18] in 
the setting of finite Markov chains, and (independently) in [19] for reaction- 
diffusion systems. Closely related metrics have been considered in [7] in the 
setting of Fokker-Planck equations on graphs. Our discussion follows [18] 
with a slightly different normalisation convention (see also [10]). 

We work with a discrete, finite space X and a Markov generator C, acting 
on functions •0 : Tb ^ M by 

fjGX 

The transition rates Q{r],fj) are nonnegative for all distinct rj^f] ^ X, and 
we use the convention that = 0 for all rj X. We assume that Q is 

irreducible, that is, for all rj,fj £X, there exists a sequence {r/i}(hQ C X such 
that T]o = r], r]n = t), and (5(?yi, ?yi+i) > 0 for all z = 0,..., n — 1. It is a basic 
result of Markov chain theory that there is a unique stationary probability 
measure tt on X, which means that 

^7r(7/) = l and 7r(f)) = ^ Q(r/, r))7r(r/). 
r,€X r}£X 

We shall always assume that vr is reversible for Q, that is, the detailed 
balance equations 

Q(r/,f))7r(r/) = Q(f), r?)7r(f)) 

hold for all r],ri £X. We shall refer to (T’,Q,7r) as a Markov triple. 


2.1. The nonlocal transport metric W. Let 



p \ X —^ M. 


^p(r/)7r(r?) = 1 
v(^x 


be the set of probability densities on X (with respect to the stationary mea¬ 
sure vr). We consider the nonlocal transport metric W defined for PQ,pi € 
3^{X) by 


y^{po,Pif :=inf 

p,P 


vMX J 
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where the infimum runs over all sufficiently smooth curves p : [0,1] —)■ 
and : [0,1] ^ satisfying the continuity equation 

(3) + = ^ Vr/Gd:’, 

fj&X 

with boundary conditions p\t=o = Po and p\t=i = Pi- It has been shown in 
[18] that W dehnes a distance on the set of probability densities 
Moreover, this distance is induced by a Riemannian structure on its interior 

:= {p G : p{x) > 0 for all x £ X}. 

This definition is a natural analogue of the Benamou-Brenier formulation 
of the Wasserstein metric, with one crucial additional feature in the discrete 
setting: the logarithmic mean 

ri g 

(4) p(p,fi):=9{p(p),p{fi)) with0(r, s)= / r^~^s^dp=- - - - 

Jo logr-logs 

is used to dehne, loosely speaking, the value of p along the edge between p 
and fj. Note that the integral representation of 9 holds for all r, s > 0, while 
the alternative expression requires r and s to be strictly positive. The main 
reason for the appearance of this particular mean is the fact that it allows 
one to formulate a discrete chain rule pS/\ogp = Vp, where 

:= V'(h) -V’(h) 

denotes the discrete gradient. This identity takes over the role of the usual 
chain rule Vlogp = Vp/p that conveniently holds in the continuous setting. 

In [18, 19], it is shown that the “heat equation” dtp = Cp is the gradient 
flow equation for the relative entropy 

np) := E 7r(??)p(p) logp(p), 

with respect to the Riemannian structure associated with W. In this sense, 
the metric W is a natural discrete analogue of the L^-Wasserstein distance. 
Moreover, at least in some situations [13], it can be shown that the metric 
W converges to the L^-Wasserstein in the continuous limit, in the sense of 
Gromov-Hausdorff. 

Following the Lott-Villani-Sturm approach, [9, 18] gave the following 
dehnition of lower Ricci curvature boundedness. 

Definition 2.1 (Ricci curvature lower boundedness). We say that a 
Markov triple {X,Q,7r) has Ricci curvature bounded from below by k G M 
if for any constant speed geodesic (pt )tG[o,i] in {^{X),W) we have 

'H{pt) < (1 - t)'H{po) + fH(pi) - |t(l - t)>V(po, pi)^ 
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We shall use the notation Ric(Af,Q,7r) > k [or briefly Ric((5) > k]. Since 
{^{X),yV) is a geodesic space, this definition is nontrivial. This notion of 
curvature has several interesting properties, obtained in [9]: 

• A Markov chain that has Ric((5) > «; > 0 satisfies various functional in¬ 
equalities, such as the modified logarithmic Sobolev inequality 

n{p) <-S{p,logp) ype^{X), 

a 

with constant a = 2 k, as well as a Poincare inequality 

YaTTr{'4>) < VV’:A’— 

A 

with constant A = k. Here, Vaiv(?/^) = 7r[ip‘^] — 7r[V']^, and £ denotes the 
discrete Dirichlet form given by 







^ rj 

These functional inequalities imply the exponential convergence estimates 
U{e^^p) < e-^^n{p) and \\e^^'ip\\L2(x,n) < e~^^U\\L^(x,n) 
along the heat equation dtp = Cp. 

• Ricci curvature bounds tensorize: if Ric(Ai, Qj, vTj) > Ki, then the product 
chain (Ti x A 2 , Q, tti (8) 7r2) dehned by 


{ Qi{xi,yi), 
Q2{x2,y2), 
0 , 


satisfies Ric(<Ti x ^ 2 , Q,tti (8) vr 2 ) >min{Ki,K 2 }. 


if X2 =y2, 
if xi =yi, 
otherwise, 


Remark 2.2. Besides the entropic Ricci curvature studied in this pa¬ 
per, several other notions of discrete Ricci curvature have been studied in 
the literature, including Ollivier’s coarse Ricci curvature [22], rough Ricci 
curvature by Bonciocat and Sturm [3] and Bakry-Emery-type conditions, for 
example, [16]. We refer to [23] to a survey covering some of these concepts. 

2.2. The Riemannian structure induced by W. It will be useful to de¬ 
scribe the Riemannian structure associated to W in more detail. Let £ := 
{{x,y) T X X X : Q{x,y) > 0} be the set of edges in the graph induced by 
Q, and let G be the set of discrete gradients, that is, all functions T : M 

of the form T(ry, fj) = V'0(r/, fj) := — V’(^) for some function ■0 : T —>• M. 
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Note that, at each p € the tangent space of is naturally 

given by T := {cr: —)• ]R| T(77)7r(r7) = 0}. It can be proved (see [18], 

Section 3) that, for each pG the mapping 

ICp : V'ljj 1 -^ E {'ipip) - 'ijj{p))p{p,fi)Q{p,fj) 
n&x 

defines an bijection between Q and T. At each p G this map allows us 

to identify the tangent space with Q. In other words, the continuity equation 
(3) provides an identification between the “vertical tangent vector” ^p and 
the “horizontal tangent vector” V^. A Riemannian structure is then defined 
by the p-dependent scalar product (•, •)p : ^ x ^ M given by 

■fjex 

ior d>, 'I': £1 —>■ M. We shall frequently use the notation 

A{p,'ijj) := liVV'llp, 

where || • \\p denotes the norm induced by the scalar product 

The geodesic equations for this Riemannian structure are given by 

dspsiv) + = 

rj&X 

ds'lpsip) + - '^{Mv)-'>Ps{p)f{ps)i{p,P)Q{p,v)=0, 

mx 

where, by abuse of notation, pi denotes the partial derivative of the loga¬ 
rithmic mean defined in (4), that is, for z = 1,2, 

Pi{p,p) :=di9{p{p),p{fi)). 




These equations can be regarded as discrete analogues of the continuity 
equation and the Hamilton-Jacobi equation, respectively. The Hessian of 
the entropy can then be calculated using the expression 


(HessiR(p)Vz/>,Vz/>)^ 


ds^ 


Ent(ps), 

s=0 


where (ps) solves the geodesic equations with initial conditions p|s=o = Po 
and V'|<j=o = "00 • The following result provides an explicit formula for the 
Hessian which resembles its continuous counterpart (1); we refer to [9] for a 
proof. With a slight abuse of notation, we write 
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Proposition 2.3 (Identification of the Hessian). For any p € 
and i/;: —)• M, we have 

:= (HessiH(p)Vi/>,Vi/>)^ 

= \(^P' - {p ■ 'V'ljj, 

where 

tp{p,p) ■.= pi{p,p)Cp{p) + p2{p,p)Cp{p). 

Since the metric structure we consider is Riemannian in one ex¬ 

pects that K-geodesic convexity of H. is equivalent to 

(Hess?^(p)ViA,ViA)^>Kl|VV^||2 

for any function i/) and any probability density p G As the Rie¬ 

mannian structure is degenerate on the boundary of ^(A’), this is not an 
immediate result, but this issue has been solved in [9], Theorem 4.4. We 
thus have the following result. 

Proposition 2.4 (Characterization of Ricci boundedness). Let k gE.. 
A Markov triple (iY,Q,7r) satisfies Ric(A’,Q,7r) > k if and only if for all 
p G and all functions : iY we have 

(5) B{p,fi)>KA{p,fi). 


In this paper, we provide a systematic method to prove such an inequality 
in concrete situations. A very useful criterion will be given in Section 3. 


2.3. The Bakry-Emery approach to the logarithmic Sobolev inequality. 
Let us now compare our method with the Bakry-Emery approach to the 
(modihed) logarithmic Sobolev inequality (MLSI), which has been devel¬ 
oped in the discrete setting in [4] and [5]. In order to prove the MLSI Ti{p) < 
^£{p, log p), in the Bakry-Emery method one considers the behaviour of the 
entropy h{t) := TLipt) for solutions to the heat equation dtpt = Cpt- Since 
h'{f) = —£{ptfiogpt), the MLSI asserts that h{t) < —^h'{t). Rather than 
approaching this inequality directly, the idea is to investigate the second 
derivative of the entropy. Suppose that one could prove the “convex en¬ 
tropy decay inequality” h"{t) > —ah'{t). Since h{t),h'(t) —>■ 0 as t —>■ oo, this 
inequality implies the MLSI after integration. Taking into account that 


h''{t) = TT[CptClogpt] + TT 


i^Pt 


Pt 


the following lemma (taken from [5]) summarises this discussion. 
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Lemma 2.5. Let a > 0 and suppose that the convex entropy decay in¬ 
equality 


-nlCpClogp] + vr 



> aE{p, log p) 


holds for all p G Then the modified logarithmic Sobolev inequality 

T~L{p) < ^£{p,logp) holds as well. Moreover, (2.5) implies the exponential 
convergence bound 


£{e^^p, log e^^p) < e °‘^£{p, log p) 


for all p G 


The last assertion follows readily from Gronwall’s lemma and is actually 
equivalent to (2.5). A direct calculation shows that 


•^{pfiogp)=£{pfiogp), 

B{p,log p) = ]^-K[CpClogp] + \ 


{C-pf 

P 


hence, in view of Proposition 2.4, the following lemma follows immediately. 


Lemma 2.6. Let k G M and suppose that ^lc{X,Q,' k) > k. Then the 
convex entropy decay inequality (2.5) holds with a = 2K. 

The condition Ric(A’, Q, vr) > k is in principle strictly stronger than the 
convex entropy decay inequality (2.5): we need to check that the inequality 
B{p,'ip) > KA{p,ijj) holds for all functions if, and not just for if = logp. 
There seems to be no reason why if = log p should always be the extremal 
case in this inequality, but we are unaware of an explicit example which 
demonstrates this. 

In general, the constant we shall obtain here for the lower bound on 
Ricci curvature is worse (i.e., smaller) than the best known constant for 
the modified logarithmic Sobolev inequality. This was to be expected, since 
even in the continuous setting lower bounds on the Ricci curvature have no 
reason to yield the optimal constant for the logarithmic Sobolev inequality 
in general. 


3. The Bochner method. To explain and apply our method, it will be 
convenient to use the following point of view. Given an irreducible and re¬ 
versible Markov triple {X, Q, vr), we consider a set G of maps of X onto itself, 
that represents the set of allowed moves, and a function c: X x G —M+, 
that represents the jump rates. 
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Definition 3.1. The pair (G, c) is called a mapping representation of 
C if the following properties are satisfied: 

1. The generator C can be written as 

5eG 

where Vsipiil) •= ~ fpiv)- 

2. For any 6 G G, there exists a unique € G satisfying 6~^{Sr]) = r] for 
all r] with c{rj,S) > 0. 

3. For every F : Af x G —>■ M, we have 


( 6 ) 


TT 


'^F{r],6)c{r],5) 

SeG 


= TT 


^F{6rj,6 

'-<5gG 


- 1 ' 


c{r],6) 


Every irreducible reversible Markov chain has a mapping representation. 
It is always possible to explicitly build one, by considering the set of bi- 
jections : ff —)■ ff that exchanges r/ and fj, and leaves all other points 

unchanged (see [9]), but in general it is more natural to work with a differ¬ 
ent, smaller set G, as we shall see in the examples of Section 4. Property 3 
expresses the reversibility of the Markov chain. 

With this notation, we can rewrite the action functional A as 

(5eG ^ 

As discussed in the Introduction, a key element of the optimal transport 
approach to curvature in the continuous setting is the Bochner identity 
|A(|V'i/’|^) — (Vi/;, VAi/;) = + Ric(V'0, Vi/^). This identity allows one 

to reformulate 

B{p,^l;)-.= l iA(|ViM2)-(VV^,VAV^)dp, 

JM ^ 

the continuous analogue of B, in the equivalent form 

B{p,^Ij)= [ + mc{V'ilj,V'ilj)dp. 

JM 

Since the first term in this expression is nonnegative, lower bounds on the 
Ricci curvature immediately translate into lower bounds on B{p,'tp). 

In the discrete setting, as far as we know, there is no direct analogue of 
Bochner’s identity. To get around this issue (in the context of the Bakry- 
Emery approach for functional inequalities), it was suggested in [4] and [5] 
to introduce a function R that satisfies some properties of spatial invariance, 
in such a way that a one-sided Bochner inequality holds. 
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Assumption 3.2. There exists a function R: X x G x G ^ M+ such 
that 

(Al) i?(ry, 7 , 5) = S, 7 ) for all 77 G A and 7 ,5 G G; 

(A2) 7r[^^ 5 R(j], 7 , (5)7/;(77, 7 , d)] = R{t], 7 , S)i/j{'yr], 7 -^, 5)] for all bounded 

functions ip : X xGxG— 

(A3) 'yd'q = 5777 for all 77 G A and ^,6 €G with R{r], 7 , 6 ) > 0. 


Clearly, (Al) and (A2) imply that for all bounded functions ip, 


(7) 


vr 


'^R{r],'y,6)ip{r],'y,6) 

^ 7,^ 


= TT 


'^R{r],-f,S)ipi6r],'y,6 7 

^ 7,5 


We will show that a function R that satisfies these assumptions automati¬ 
cally satisfies the following identity. 


Lemma 3.3. Let y? and ip be two real-valued functions on X, and let 
a : X X X be a symmetric function. Then the following identity holds: 


vr 


^ R{r], 7 , S)a{r], drfjV-^ip{ri) 

^ 7,5 



^ R{r], 7 , 6)V-y[a{r], 6r])Vs^piv)]'^5'^'rHv) 

^ 7,5 


In the case a = 1, the right-hand side in this equation can be thought of 
as a discrete analogue of {D'^ f, D'^g) dp. We can therefore think of this 

identity as a sort of weighted discrete Bochner identity. 


Remark 3.4. This is a generalization of the identity used in [5], where 
the authors proved this identity in the case a = 1. In the applications, we 
have in mind, we will use this identity with a the logarithmic mean, that is 
a{g,Sr]) =p{g,Sr]). 

Proof of Lemma 3.3. We consider 77, 5 and 7 such that R { g ,' y , 6 ) > 0. 
Note that, by assumption (hi), 5'ji] = 'y6g. First, we write 

V7[a(77, (577)V5(/?(77)] V5V^7/>(77) 

= a(777, 6jg)S/sipi'yg)S/-.fip{6g) - ai^yg, 6gg)S/sip{'yg)S/.yip{g) 

- a{g,5g)Vs(p{g)V^ip{6g) + a{g,Sg)V5:p{g)V^ip{g). 

We will show that each of the four terms on the right-hand side of this 
equality, when multiplied by R { g ,' y , 6 ), summed over all 7 and h and aver¬ 
aged over TT, yields 

'^R{g,g,5)a{g,6g)V5(p{g)V-^ip{g) . 

^ 7,5 


TT 
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For the fourth term, there is nothing to prove. For the third term, we 
have 


— TT 




7,(5 


= TT 


^ R{i], 7 , S)a{r], 6r])Vs-i (p{5r])V^ijj{6T]) 


'y,S 


(7) 

= TT 


= TT 


^ ii( 7 , 7 , 6)a{Sr], r])Vs(p{Tq)V 

~ 7,(5 

^ R{rj, 7 , 5 )a( 7 , d'n)Vsip{v)'^^ip{r]) 

7,5 


The same argument works for the second term. Finally, for the first term, 
we have 


TT 


<^77?) V5(^(77) V.yV’(<^7/) 

'7,(5 


= —TT 


Sjr])S7sip{jr])\/^-iTl;{6-fr]) 

*" 7,(5 


(A2) 

= —TT 


^ i?(7,7,5)a(7, Sr])V5ip{v)'^-y'ijT{Sr]) 


^ 7,^ 


= TT 


^ R{r], 7 , (i)a( 7 , 6r])Vs(p{r])V^'il;{r]) 

7,(5 


where the last identity is the one we proved right before for the third term. 
This completes the proof. □ 

Following [5], our strategy is to decompose the local weight c{r],6)c{r],'y) 
as R{rj,S,'j) + F( 7 , <5, 7 ) for a suitable function F. Since the contribution of 
R to the curvature is nonnegative, we will only have to study lower bounds 
for a functional that depends on F. 

Theorem 3.5. Assume that there exists a function R satisfying As¬ 
sumption 3.2, and let 


Then we have 

( 8 ) 


r( 7 , 7 ,( 5 ) :=c{r],'y)c{r],6) - R{r],-f,6). 


B{p,i;)>iB, + B2 + Bs)ip,TP), 
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where 


^ 2 (p,V') 


:= TT 


7 , S)Kv, Sr])Vs'ilj{vW^'il^{v) 


75^ 


:= —TT 
2 


■ 4^ 


*" 7,(5 


Y1 7, ^)Kv, V5V'(^))' 


/or all p € and a// ^ : A” ^ R. 

As a consequence, if Bi7-B2 + B^> kA, then the Ricci curvature of the 
Markov chain is bounded from below by k. 


Inequality (8) can be viewed as a discrete replacement for Bochner’s in¬ 
equality (2); the quantity B{p,'if), which is a discrete analogue of the contin¬ 
uous expression —{V'ijj,VA'ip) + ^A(|V'0p)dp, is estimated in terms of 
B-i{p,'if), a nonnegative expression involving only second-order derivatives, 
and a number of terms involving first-order derivatives that may be seen as 
curvature terms. 


Remark 3.6. In most of our applications, we will use the trivial bound 
Rs > 0, so that we will only need to obtain a lower bound on the terms Ri, R 2 
involving F. 


Remark 3.7. This result can be used to recover the criterion of Propo¬ 
sition 5.4 in [9]. Indeed, under their assumptions, we can take R{p,d,'y) = 
c{p,6)c{p,'y) and, therefore, T = 0. The conclusions easily follow. This crite¬ 
rion has been applied in [9] to obtain the sharp Ricci bound for the discrete 
hypercube, as well as nonnegativity of the Ricci curvature for the discrete 
cycle Z/AZ for N >2. 


Proof of Theorem 3.5. The quantity B{p,'if) from Proposition 2.3 
can be written as 

B{p,'ll)) = -{pV'lf^VCfj).,, + i(;CpVV’, VV’) 7 r, 

where 


{pV'ip,VCTp)^ 



V^il){5p)c{5p,-i) - V^f;{p)c{p,-f))c{p,6)p{p,6r]) 
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and 


= ^TT 


(9) 


■ 7,5 


+ hiv, ^v)'^'yP{^v)c{Sv,l))c{v, 5 ) 
Since the Markov chain is reversible, we can use (6) to obtain 
^5V'(?/)V^'0(5?7 )c(5 ?7, 7 )c(r 7 , d)p{r], Sp) 


TT 


7,5 


= —TT 


^ Vsilj{r])V^il;{r])c{r], -f)c{r], d)p{r], Sp) 

■ 7,5 


so that 


-(/5VV’, =7r 


= TT 


^ Vsip{v)V^'ip{p)c{p, -f)c{p, 6)p{p, Sp) 
7,<5 


7,<5 


+ TT 


^ -R(r?, 7, 5)V5i}{p)V^il}{p)p{p, Sp) 

■ 


The first term on the right-hand side of this equality we leave unchanged, 
and for the second we use Lemma 3.3 with a = p to get 


- (/5VV', V£V’)7r = TT 


( 10 ) 


7, 5)V 'ii’{p)p{p, Sp) 

■ 7,5 

1 


+r 


R{p, 7, 5)V5V^i;ip)V^[p{p, 6p)V5^l^{p)] 


We now take a look at the second term in (9). Using the reversibility of 
the Markov chain and the fact that p 2 {Sp,p) = pi{p,6p), we obtain 


TT 


'^{V 5 ip{p))‘^p 2 {p, 6p)V^p{dp)c{6p, p)c{p, (5) 


^ 7,5 


= TT 


hiP,Sp)V jp{p)c{p,p)c{p,6) 

7,5 
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and, therefore, 

1 


= -TT 


= —TT 
2 




7,(5 


X] <^)(V 5 ^(r?))^/ 5 i( 7 ?, Sr])V^p{r]) 


7,<5 


1 


Y1 '5)(V5V’(T/))^/5i(r/, 6p)V^p{p) 

^ 7,(5 

= :ri+T2. 

Again, we leave the first term on the right-hand side of this last equation 
unchanged. For the second term, we use assumption (A2) to obtain 

1 


T 2 = —TT 
2 


'^R{p,-f,6){Vs'ilj{p)fMv,Sp)V^p{5p) 

*” 7;^ 


and thus 

1 


T2 = -7r 


'^R{p,-i,5){V5'il^{p)f{h{p,^p)^-ip{v)+h{p,^p)^iP{5p)) 

*" 7,(5 

From (i) and (ii) of Lemma A.l we deduce the inequality 

pi{p, 6p)Vjp{p) + p2{v, 6p)Vjp{6p) > V^p(?7, 6p). 

Therefore, we have 


i(TpVV’,VV’). 


( 11 ) 


1 

> —TT 
“ 2 


7,<5 


1 


'^R{p,'y,5){Vsijiv)fV^p{p,5p) . 

7,5 

To deduce our result from (10) and (11), all that is left is to show that 


S:=7r 


R{p, 1, 7((V5V’(f/))^V.yp(r7, 6p) 

7,(5 


(12) 


-h VsVy'il;{7])Vy[p{7j, dp)Vs7p{v)]) 


= TT 


^ ii(r7,7,7/5(?7, (i?7)( V^V5^/;(r7))^ 

7,(5 
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We observe that 

{Vs'ipir])f'^^p{r],6rj)+ VsV.y'ip{rj)V^[p{rj,6r])V5'^{v)] 

= Kiv, h'n){'^ s'*P{'n)f 

- p{r], dr])Vs^Piv) + pi'yv, 6'y7])Vsip{'y7l)'^sip{^l), 

for all p G X and ^,5^0 with dpp = p6p. In view of (A3), we may insert 
this identity in the expression for S, which yields 


S = TT 


^ R{p, 7 , ( 5 )/ 5 ( 7 ? 7 , 5pp){v 57p{p)f 

^ 7,5 


— TT 


^ R{p, 7 , 5)p{p, 6p)V 57p{pp)V 57p{p) 

^ 7,5 


+ TT 


^ R{p, 7 , 6)p{'pp, Spp)'^ 57p{pp)V 57p{p) 

^ 7,5 


It follows from (A2) that the second and third term in the latter formula 
cancel each other. Another application of (A2) shows that the first term 
equals the right-hand side of (12), which was the last element we needed to 
complete the proof. □ 


4. Examples. 


4.1. Birth and death processes. In this section, we consider the case of 
birth and death processes on N = {0,1,...}. These are Markov chains with 
generator 

C7p{n) = a{n){7p{n + 1) — 7p{n)) + b{n){7p{n — 1) — 'ip{n)), 

where a and b are nonnegative functions on N, such that 6(0) = 0. The set of 
allowed moves is G := {+, —}, where +{n) = n-|-1 and — (n) = (n — l)]l{„>o}- 
Following the notation of the previous section, we write ^±'ip{n) = 7p{n ± 
1) — 7jj{n). The generator can therefore be written as 

C7p{n) = a{n)V+7p{n) + b{n)'V-'ip{n), 

and, in accordance with our notation, we set c(n,-|-) = a{n) and c(n, —) = 
6(n). 

We assume that this Markov chain is irreducible, and that there exists a 
probability measure vr on N satisfying the detailed balance condition 


a(n)7r(n) = 6(n -|- l)7r(n -|- 1). 
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Clearly, the latter condition is satisfied if and only if 

a{n — 1)•••a(0) 
bin)---b{l) 

When applying our method to study curvature bounds for this Markov chain, 
we obtain the following result. 


E 

n=l 


Theorem 4.1. Let k G M+. Assume that the rate of birth a is nonin¬ 
creasing, and that the rate of death b is nondecreasing. Assume moreover 
that 


(13) 


\{a{n) — a{n + 1) + b{n + 1) — b{n)) 

+ ^0(o(n) — a(n + 1), 6(n + 1) — b{n)) > k 


for all n G N, where 

Q{a,P)= inf (9(s,t)("- + yy 
s,t>0 \S t J 

Under these assumptions, the birth and death process has Ricci curvature 
bounded from below by n. 


Remark 4.2. Strictly speaking, to include this example in our frame¬ 
work we need to assume that the transition rates o(n), 6(n) vanish whenever 
n is sufficiently large, so that the state spaces becomes finite. However, it is 
reasonable to expect that the result below and its proof remain valid without 
this assumption. A rigorous inclusion of the countable setting would require 
us to modify some technical arguments from [9, 18], which is beyond the 
scope of the present paper. 

Remark 4.3. The same criterion has been obtained by Mielke [20] using 
a different method based on diagonal dominance of the matrix representing 
the Hessian of the entropy. Mielke worked on a finite state space {0,1,..., A^} 
rather than on N, but this does not effect the argument. 


Proof of Theorem 4.1. First of all, using the reversibility condition, 
we can rewrite the action functional as 

M(p,'i/’) = ^7J'[o(n)(V+'i/’(^))^/5(^)^ + 1) + b{n){Vp{n,n - 1)] 

= 7r[a(n)(V+'!/’(n))^/5(n,n-F 1)]. 

Following [5], we define the function i? : N x {-|-, —}^ —>■ M by 

{ R{n, +, +) = a(n)a(n + 1), 

R{n,-,-) = b{n)b{n-l), 

R{n, +, -) = R{n, -, +) = a{n)b{n) 
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SO that r(n, +, +) = —a(n)V+a(n), r(n, —, —) = — 6 (n)V_ 6 (n), and r(n, +, —) 
r(n, —,+) = 0. It is not hard to check that Assumption 3.2 is satished. In 
this case, we have 




= TT 


^ 7,<5 


(14) 


= —7r[a(n)V+a(n)(V+V'(^)) /5(n,n + I) 


+ 6 (n)V_ 6 (n)(V_'i/’(n)) p{n,n — l)] 

= 7 r[a(n)(V+ 6 (n) — V+a(ra))(V+'i/’(n))^/5(n, n + 1)]. 
Moreover, using reversibility. 


B2{p,'ip) = 


*” 75 ^ 


= -^vr[o(n)V+o(n)(V+V’(?^))^/Oi(?^,?T' + l)V+/o(n) 
+ 6(re)V_6(n)(V_'i/’(n))^/3i(n,n — l)V_/ 9 (n)] 

= -^vr[o(n)V+o(n)(V+V’(?T^))^/Oi(?^,?T' + l)V+/o(n) 


+ a(n)V+ 6 (n)(V+V'(n))^p 2 (?T','ri + l)V+p(n)]. 

A direct computation shows that the partial derivatives 6i{s,t) = di6{s,t) 
satisfy the identities 

{t - s)0i{s,t) = - 9{s,t), {s-t)d 2 {s,t) = - d{s,t), 

s t 

for any s,t > 0. Therefore, recalling that p{n,n + 1) = 6{p{n), p{n + 1)), we 

have 


B 2 ip,i’) = -X[a(n)(V+ 6 (re) - V+a(n))(V+V’(n))7(n,n + 1)] 

(15) 


1 

H—vr 
2 


are 


V+ 6 (re) V+a(re) 


(V+T/>(re))p(n,re+ 1)" 


^p{n + l) p{n 
Using the dehnition of 0, we obtain the inequality 

V+ 6 (re) V+a(re) ^ 0(—V+a(re), V+ 6 (re)) 


p(re + I) p(re) /5(re,re + I) 

Summing (14) and (15), and substituting the latter inequality, it follows 
from the assumption (13) that 


(01 +52)(p,'i/’) > kA{p,iI}). 

The result follows by an application of Theorem 3.5. □ 
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4.2. Zero-range processes. We now consider interacting particle systems 
on the complete graph, whose sites are labeled 1,...,L, where L is the 
number of sites. Configurations of particles are given by collections of non¬ 
negative integers rji,... ,rjL, where rjx denotes the number of particles at site 
X. The whole configuration will be denoted hy rj £ S := N'^. 

Changes in the configuration are made by moving a particle (if any) from 
a site X to a different site y. Allowed moves are therefore given by maps of 
the form y t—where x^y are two different sites, and is given by 

{ r]z^='nz, z ^ {x,y} or r]x = 0, 

r]x^='nx-l, iir]x>0, 

rjy'^ =r]y-\-l, iir]x>0- 

The set of allowed moves is G := {xy,x 7 ^ y}, where xy denotes the map 
rj rj^y. We write Vxy to denote the corresponding discrete gradient, and 
note that {xy)~^ = {yx) in the sense of Definition 3.1. 

In this section, we consider zero-range processes on the complete graph 
with L vertices. In such processes, the jump rate from site x to site y de¬ 
pends only on x and on the number of particles present at x. The rates are 
thus given by a family of functions : N —)■ [0, 00) such that Ca;(0) = 0 and 
Cx{n) > 0 for all n > 0 and x £ {1,..., L}. Here, Cx{yx) is the rate at which 
a particle is moved from site x to a site y, with y chosen randomly, with 
uniform probability on {!,...,L}. The generator of the Markov chain we 
just described is 

^fiv) = j'^Cx{yx)'^xyf{v)- 


This dynamic conserves the total number of particles N := rjx- We define 
a probability measure tttv on configurations with N particles as 


-KNiy) := 


1 

Zn 


L rjx 

nn 


x=l k=l 


1 

Cx{ky 


with the usual convention that the latter product equals 1 whenever yx = 0. 
The constant is a normalization constant, which is indeed finite, since 
there are a finite number of configurations with N particles. It can then be 
checked that the Markov chain is reversible with respect to vrjv- In the sequel, 
we shall systematically omit the subscript N, which we consider fixed. 

When applying our method to study the curvature of this process, we 
obtain the following result. 


Theorem 4.4. Assume that there exists c > 0 and 5 £ [0,2c] such that 
(16) c < Cx{n-i-1) — Cx(n) < c-i-d for all x £ {1,... ,L} and n>0. 
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Then the Ricci curvature is bounded from below by ^ In particular, if 

6 < |c, then we obtain a positive lower bound on the Ricci curvature. 

Note that the assumption implies that the rates Cx are strictly increasing. 
Of particular interest is the fact that the bound does not depend on either 
L or N. 


Remark 4.5. Simply assuming that the rates are strictly increasing is 
not enough to ensure Ricci curvature is positive. In Section 4.2 of [5], a simple 
example of a zero-range process for which supc 3 ;(l) — inf Cj;(l) is large was 
shown to exhibit nonconvex decay of the entropy and, therefore, the Ricci 
curvature of the Markov kernel cannot be positive. So some assumption of 
the form 6 < Kc is necessary. On the other hand, we do not know if the 
assumption 6 < 2clb is optimal. Assuming a uniform positive lower bound 
on the increase of the rates is known to be necessary to obtain a lower 
bound on the curvature that is independent of the system size and number 
of particles. Indeed, it was shown in [21] that for the case of constant rates, 
the spectral gap is of order ■ 

Remark 4.6. For independent random walks on the complete graph 
(which corresponds to d = 0 and c = 1), we recover the size-independent 
lower bound on Ricci curvature that was obtained in Example 5.1 of [9]. 

The estimate on the modihed logarithmic Sobolev inequality that can be 
deduced from the curvature bound has a worse dependence on 5 than the 
one obtained in [5] [which would correspond to a constant (c — (5)/2]. The 
modified logarithmic Sobolev inequality for this model has also been studied 
in [6]. 


Proof of Theorem 4.4. First, we observe that 




Cx{r]x){Vxyil^{v)fp{v, 




Following [5], we define the function R as 

1 

R{ri, xy, uv) = < 


L2 

1 


Cx iVx ) Cu {rju )) 


for X ^ u, 


{ L2 


Cx{r]x)cx{Vx - 1), otherwise. 


with the understanding that Cx(—1) := 0, so that T{r],xy,uv) = 0 if x u, 
and T{r],xy,xv) = jjiCx{'nx){cx{'qx) - Cxirjx - !))• It follows that 


^2{p,'ip) = -vr 


X] 6r])V^p{r]) 

■ 7,5 
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(17) 


2L2 


vr 


^ ^ Cx{,'nx)i,(^x{j]x) ('xijlx 1 )) 


-x,y,v 


X Mv,v''^)'^xvp{v) 

Using the reversibility assumption, this quantity is also equal to 
1 


2L2 


TT 


Cxi'nx)icy{riy + 1) - Cy{r]y)){V^yip{r])f P 2 {'il,r]^y) 


'-x,y,v 


x { pitn - p { v ^ y )) 
Adding this identity to (17), we obtain 
B2{p,ip) 


4L2'' 


'y ^ Cx{px){^xy'^’iv)) 


x,y,v 


X {(Cxirix) - Cx{r]x - l))pi{'n,p"'y)VxvP{'n) 

+ {CyiPy + 1 ) - Cy{py))p2{p,p'^y){p{p^^) - Piv""^))) 

^TT ^ C2;(?7x)(Va;j;'0(7?))V(?f'') 

X ((Cx(??a;) - Cx{px “ l))/5l (r/, + {Cy{py + 1) - Cy{py))p2{7J,v'"y)) 


1 

4 Z 2 


TT 


Y CxiPx){'^xy'llj{v)f{iCx{Vx) - CxiPx “ l))pi(??, ??*^)p(r?) 




+ (CyiPy + 1 ) - Cy{py))p2{p,P^y)p{V^y)) 

= :h+l2. 

Taking into account that pi > 0, we use (16) and the assumption that c > 5/2 
to obtain the lower bound 


h > 772^ 

4L2 


y-^^v § 

- sU' 


^ + «(>?,If'*)) 

T,y,v 

E c-(^-)(V-V’(^))V(^"^)(pi(^,^"")+/52(r?,r7"^)) 


-x,y,v 
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=: h- 


This quantity is nonnegative, but we will need it later to compensate a 
term that appears when computing Using (16) once more and 

Lemma A.l(ii), we obtain 


^ 0 + 6 
a -ly' 


0 + 5 
' 4L ' 

0 + 6 


^ C:^{r]^){V^y^l;{r])f{pi{r],r]^y)p{r])+p2{r],rj^y)p{r]^y)) 

x,y,v 

^ C:,{p^){V^yTp{p)fp{p, p^y) 
x,y 

A{p,'iIj), 


so that 
(18) 

Next, we observe that 
Bi{p,'ilj) = TT 


B2{p, -0) > -^^A{p, -0) + Is- 


7,5 


1 

“l2^ 


Cx{Px){Cx{Px) -Cx{Px - l))Vxyi’ip)^xv'4’{p)p{P-,n^^) 


-x,y,v 


By another application of (16) and reversibility, 
1 


^l(P)V’) = 


^ ^ Ox{+x^{,Ox{j]x') Oxijjx 1 )) 


x,y,v 


X '^xy'lp{v){'^xy1p{v) 


> 2cA{p,'tp) + j^TT 


Cx{Px){Cx{Px) - Cx{Px - l))Vxy'tp{p) 


-x,y,v 


X Vy,i;{p-y)p{p,p-y) 


= : 2cA{p,^p) - j^TT 


Cx{Px){Cy{Py + 1) - Cy{py)) 


■x,v,y 


X Vxyi>{p)^ XV il;{p)p{p,p^y) 
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Averaging both expressions for applying (16), and using the in¬ 

equality —2\ab\ > —a? — 6^, we obtain 


(19) 


^l(/5,V’) 

> C^(/9,V') 

1 


+ 


2L2 


Cx{rix)[Cx{px) - Cxitlx - 1) - {CyiPy + 1) “ Cy{r]y))) 


■x,v,y 


X '^xyi’i'n)'^ XV 


> {c-2 )^(P>V’)-^vr 




■x,v,y 


Adding (18) and (19) and applying Theorem 3.5, we obtain 


( 20 ) 

where 


B{p, 'tjj)>(--S] A{p, Ip) + h, 




^ p{r],rf'‘) 


-x,y,v 


Since reversibility yields 

XV 


IT 

'-x,y,v 

we obtain after averaging, 

5 




= TT 


Y,CxiPx){^x.mfpiv^^,rj^^) 


x,y,v 


8L2^ 


cx{vx){'^xvip{p))‘^ipiv''y){piiv,v'' 


x,y,v 


+ P2iv,vn)-Kr],v^^)-Kv^^,v^^)) 


Applying Lemma A.2 with r = p(r/^^), s = p{p) and t = p(ry*^), we infer that 


- “815' 


X] ivipiri)fK’h’l'"') 


x,y,v 


= -^Ap,'^P)- 


In view of (20), we obtain the desired result. □ 
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4.3. Bernoulli-Laplace models. We consider the exclusion process on the 
complete graph with L sites and N particles, where 1 < < L. The moves 

are of the form r] i—>■ ? 7 *^, where 

(r]7 = r]z, ifz^{x,y}, 

< Vx^ = 0, if - Tjy) = 1, otherwise = r^x, 

\ r]y^ = 1 , if rjxil - Vy) = 1, otherwise rjy^ = rjy. 

Note that such moves conserve the total number of particles. The state space 
S consists of all configurations with at most one particle on each of the L 
sites, that is, S = {rj ^ {0, 1 }^ : The set of moves is given by 

G := {{x, y) G {1,..., L}^ : x ^ y}. To simplify notation, we will frequently 
write xy instead of {x,y). The transition rates q : S x G ^ M+ are given by 

q{ri,xy) = 

hence the generator L takes the form 

= >^xrixil - rjy)^xy'ipiri), 

xy^G 

where S/xy'>p{'q) = — 'tp{ri). We shall frequently use reversibility in the 

form 


(21) 

TT 

^ F{T],xy)q{T],xy) 

= TT 

^ F{rfy,yx)q{ri,xy) 



-xy^G 


-xy£G 


for arbitrary functions F : S x G —>■ M. We observe that 





L x,y 


Note that we can omit the factor rjx{^ — fly)-! since S/xyil^iv) = 0 whenever 
r]x{l — r]y) = 0. This remark will allow us to forget this factor in most of the 
expressions that we shall manipulate. 

We obtain the following result on the Ricci curvature for Bernoulli- 
Laplace models. 


Theorem 4.7. Let 1 < < L — 1, and assume that there exists c > 0 

and (5 G [0,2c] such that 

(22) c<Xx<c + 6 for all X € {1,..., L}. 

Then the Ricci curvature of the Bernoulli-Laplace model is bounded from 
below by ^ In particular, when 6 < |c, the Ricci curvature is positive. 

In the special case where <5 = 0, we recover the result obtained in [11] 
using a different method, with exactly the same constant. 
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Remark 4.8. If 5 < |c, our result yields a modified logarithmic Sobolev 
inequality with constant c — Once more, this bound is slightly weaker 
than the one obtained in [5] (which is c — 5 in our notation). Note however 
that both bounds coincide for the homogeneous model, in which 5 = 0. The 
modified logarithmic Sobolev inequality for the homogeneous model has also 
been studied in [2, 12, 14]. 


Proof of Theorem 4.7. As in [5], we take 
R{r],xy,uv) = l 

lo. 


for \{x,y,u,v}\ = A, 
otherwise. 


It follows from the definition that T{r], xy, uv) = 0 when |{a:, y,u,v}\ =4 and 
T{r],xy,uv) = -^Xx^uVxi^ — Vy)'nu{^ — Vv) otherwise. We also notice that 
= ^xy^^zx'il^ = 0 for any choice of x, y and z, and for any ip and 
'll). We then have 




2L2^ 


1 

“T 

2L2 

+ 

1 


'^><l{'^xy1p{v)fMV,V'"^Wxyp{r]) 


x,y 


^ KK{^xy'il^{r])fh{r],rf'^)^uyp{ri) 

\{x,y,u}\=3 

^l{^xy'4^{v)fpl{P,P''^)^xvP{v) 

\{x,y,v}\=3 


—: Ji + J 2 + J 3 . 


Using reversibility in the form of (21), we obtain, after averaging with the 
original expression. 




4L2'" 




XxXy{^xy'lljip)fhiV, V"'^)'^xypiv) 

x,y 

- >'yhip,p"'^)){p{v''^) - p{v)) 

,y 


x:,y 


= -—A{p,'ip), 


where the inequality is obtained using (24) and (22). 
















26 


M. FATHI AND J. MAAS 


Another application of reversibility and averaging yields 


2L2^ 


4L2'' 


^ X.Xy{V,ymfp2{v,V^^){p{rin-p{v^^)) 

|{x,?/,i;}|=3 

^ Xx{'^xy'lpip)f{^xh{V, P'^'^WxvPip) 

\{x,y,v}\=^ 


+ A,p2(r?,r?"^)(p(r?-)-p(r?"^))) 


4L2'' 


X] ^^(1 “ “ Pv)Xx{Vxyi^{p)f pip""'’) 

\{x,y,v}\=?, 


X (A^pi(r7,r7"^^) + 
1 


4L2 


TT 


X] ??x(l-%)(1-??«)Ax(V^yV’(??))" 

|{x,i/,i;}|=3 


X {Xxh{p,p'"^)p{p) + Xyh{p,p''^)p{p''^)) 

Using the inequality Xx, Xy > c> 6/2 and Lemma A.l(i), we infer that 

Xxpiiv,v''^)p{v) + XyP2{v,p'"^)p{p'"^) > cp{ipp^y). 

Applying these bounds, we arrive at 
<5 


Jz> 


8L2 


vr 


X] ^^(1 “ Py)i'^ - Pv)Xx{y xyl/{p)f pip'''’) 

\{x,V,v}\=‘i 


X ihip^p"^) + p2ip,p''^)) 

{L-N-l)ic + 5) 


2L 


Al(p,V’), 


where the [L — N — 1) factor appears because, if there is a particle at x and 
no particle at y, there are exactly [L — N — 1) possible sites v different from 
X and y where there are no particles. 

Similarly, we have 


2L2^ 


y~l PuXxXu{Vxy'lpip))‘^Plip,p''y)VuyPip) 

|{x,i/,u}|=3 
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4L2^ 


|{x,?;,u}|=3 


X iPiiv,v''^)'^uyp{v) + - Piv""^))) 




|{x,i/,u}|=3 

(iV-l)(c + 5) 


> - 


2L 

{N-l){c + 6) 
2L 


-A{p,'iIj) 

Aip^ip). 


We now turn to Bi. To improve readability, we shall often suppress the 
variable rj in our notation. We have 




^yipf p{r],ri^y) 


^,v 


1 

+ U’' 


^ ^ XxXu^xy'4’^uy'4’PiV^ V 

|{x,?/,u}|=3 




'■|{3;,?;,-y}|=3 

= : J 4 + J 5 + Jg. 


We have the immediate bound 


J 4 > 


Another application of (21) yields 
1 


■^6 — ^2 ^ 


^'IVxil - Py)il - Vv)'^xy1p{'llj{v'''’) - 

\{x,y,v}\=3 




1 


|{3;,y,i;}|=3 


L2’" 


+ 


^ ^ XxXyTjxi^ Py){^ Pv)'^xy'^^xv'^p{j]^P 

\{x,y,v}\=3 

L-N-1 


L2 


-TT 


Y^lPx{l-Vy){'^xy'(p?KV,V''^) 
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Averaging the latter expression with the defining formula for Jg, we obtain 
L-N-1 

Jq — 


2L2 


-vr 


- r^y){Vpi'q.ri^y) 


'-x,y 


^ ^ Vy^i^ xy'4^'^xv'4^pij]i'n 

\{x,y,v}\=3 


Writing 


■- i!2^ 


X] - '?s)(i - <h){'^xvi’f Kn, rf’) 

for brevity, we obtain using the inequality — 2|a6| > —a^ — 
c{L-N- 1) 


> 


L 




4L2 


TT 


'■|{3;,?;,-y}|=3 

{2c-5){L-N -1) 


— J 7 


2L 


--4(p,V') - -^7- 


Using reversibility and averaging, we can show that 


( 21 ) (5 

- ii?’ 


8L2 


TT 


^ XxPxi'^ - Py){l - Pv){'^xvi^f pip'"'" 

|{a:,y,i)}|=3 

x^p^{i-py){i-p^){v^^'il)f{p{p^^,p^y) + p{p,p^y)) 

■\{x,y,v}\=^ 


TT 


E A.ry.(l -r?,)(l -r),)(V.,V’)'(/5(^"^r)"^) +/5(^,r?"^)) 

Therefore, we have 

(c-25)(L-A^-l) 


da + dg > 


2L 


-Aip^tp) 


5 

H-^vr 

8L2 


^ ^ Aa,r)a;(l Pv)i^xy' 4 ^') 

|{x,y,i;}|=3 


> 


X {p{pn{h{p,p^^)+p 2 {p,p^^))-p{p^\p^^)-p{^^^^^ 

(2c-5(5)(L-iV-l) 


4L 


-A{p,ip), 
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where the last inequality has been obtained through an application of Lemma A. 2 . 
Similarly, we have 


7 1 

Js - ^ 2 ^ 


^ KKrjxil - rjy)^lu'^xy'ip{'>P{r]) - 'ip{r]^y))p{rj,r]^y) 
\{x,y,u}\=3 


1 


> —J 5 + 
and, therefore. 


^ ^ ^x^u'Hxi^ 'Hy^Vui^xy'4’) p(j]jV 
\{x,y,u}\=3 

2c{N-l) 


L 




^ c(lV-l) ,, 

J 5 >- - — -A{p,'iIj). 

When we sum up, we get 

' {L - N - l){2c - 56) {N-l){c-6) 4c-6 


B{P,'^) > 


(23) 


>' 2 - 


4L 

c 5(L — 1) — 3A^ 


+ 


2 L 


2L 


A{p,ilj) 


4L 


5 LA(p,V’)- 


At this point, what we get is a size-independent lower bound on the Ricci 
curvature of f — X' However, we can improve this bound using the follow¬ 
ing duality argument for the Bernoulli-Laplace process: if we consider the 
system with N particles, and then remove all particles while simultaneously 
adding particles at every empty site, we get a Bernoulli-Laplace model with 
L — N particles. Since this is just a change in labeling (empty and full sites 
play symmetric roles), the properties of the dynamic are invariant by this 
transform. Therefore, without any loss of generality, we can assume that 
N > L/2. Under this extra assumption, the bound (23) leads to a lower 
bound on the Ricci curvature of § — which is what we were seeking to 
prove. □ 


4.4. The random transposition model. We now consider the random trans¬ 
position model, which is a random walk on the group of permutations Sn 
with re > 2. If we denote by Tn the set of all transpositions in 5^, we can 
write the generator of the dynamics as 

■= f ^ .N X] '^rficr), 
re(re- 1) ^ 

' 7 't In 

where Vrf{cr) = /(t o ct) — f{a) and vr is the uniform measure on Sn, that 
is, vr(cr) = (re!)“^ for all a € Sn- We write r = (i,j) for the transposition that 
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swaps i and j. We also write {i,j, k) for the mapping that cyclically permutes 
i,j and k. To simplify notation, we will use the shorthand notation 

aijk-={i,j,k)oa, 

We obtain the following result. 

Theorem 4.9. For n>2, the Ricci curvature of the random transposi¬ 
tion model is bounded from below by ■ 

This bound was already obtained using a different method in [11]. It 
should be noted that the modified logarithmic Sobolev constant implied by 
this Ricci curvature bound is significantly worse (by a factor 1/n) than its 
known optimal behaviour. We do not know whether the Ricci curvature’s 
true behaviour should match the MLSI-constant. 

Proof of Theorem 4.9. It follows from the definition that the action 
functional can be written as 



Note that a factor ^ appears, since every transposition (i, j) is counted twice. 
We define R as 




otherwise. 


Using reversibility and the fact that {i,j) ^ = {i,j), it follows that 




2 ( _iN2 ^ (^b^(^))^P2(cr,crij)(/9(u*jfc) 

^ W -*-1 L | r„- 1.1 i_Q 



Averaging the latter two expressions, we obtain 
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X (/o(Tifc)/Oi(cr,cry) +p(c7ijfc)/52(cr,crij)) 

1 


re^(n — 1)2 


TT 


|{ij,fc}|=3 


X {p{(^)pi{cr,crij) + p{aij)p2{cr,aij)) 

1 


+ ■ 


rTT 


E(V.: iip{a)f{pi{a,aij) - p2{a,aij))Vijp{a) 


2n2(n — 1)2 

Using (i) and (ii) of Lemma A.l and (24), we infer that 


^2{p,4’) > 


n^(n — 1)^ 


TT 


^ {Vijip{a)fp{aik,(7ijk) 


2(n - 2) 
n(n — 1) 


A{p,ij). 


The term Bi {p, ijj) can be written as 


13i{p,iIj) = 


n2(n — 1)2 


+ 


TT 


'^{Vijilj{a)fp{a,ai 


i¥=j 


n2(n — 1)2 
4 


TT 


S/ij'ip{a)S/ik'ip{cr)p{a,aij) 

\{ij,k}\=3 


n{n — 1) 


+ 


A{p,'ip) + 


n‘^{n — 1)^ 


TT 


{Vijil;{a)fp{a,aij) 

|{i,j,fe}|=3 


re2(n — 1)2 


TT 


Y VijV'(<7)('0(crifc) - V’(cru))p(Tcru) 

|{i,Tfc}|=3 


Using reversibility and averaging, the latter term can be reformulated as 
4 


n2(n — 1)2 


TT 


E V, jilj{a){'ilj{aik) - ip{aij))p{a, aij) 

|{*,i,fc}|=3 


n2(re — 1)2 
2 

n2(re — 1)2 
X p{a,aij) 


TT 


TT 


Y VijV;(a)(V’(<T) -'4^{aijk))p{(J,aij) 

\{i,j,k}\=3 

Y VijV'(0-)(V'(0-ifc)-V’(Tij)+V’(T) -'0(o-ijfc)) 

|{i,j,fc}|=3 
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+ 


4(n-2) 
n(n — 1) 
2 


A{p,'4>) 


n^(n — 1 )^ 


TT 


^ Vijip{a){'il;{aik) -'ip{aijk))p{a,a, 
\{i,j,k}\=3 


ijj 


which yields 

Bi{p,'ip) = f-T—n + ^^ ]A{p,i)) 

\n{n — 1) n[n — Ij ' 

2 


+ 


vr 


'n?{n — 1)2 [ 


\{i,j,k}\=3 


Young’s inequality and reversibility imply that 


n n{n — 1) 

1 


n2(n — 1)2 


vr 


n — 1 
2 


-4(p,v/') - 
^(p,v/>) - 


W^ik) - i’{<yijk)fp{(y, cTij) 

|{*dA}|=3 

1 


n 2 (n — 1)2 

1 


vr 


rTT 


k<->-i n — 1' ^' n2(n—1)2 

When we sum Bi and B 2 , we get 

mp,V’)> 


{'^jk'ip{cr)fp{(^ik,(^ikj) 

\{i,j,k}\=3 

|{ij,fc}|=3 


n{n — 1) 


Aip^ip), 


which is the desired result. □ 


APPENDIX: PROPERTIES OE THE LOGARITHMIC MEAN 

In this paper, we make use of some basic properties of the logarithmic 
mean, given for a,b>0 (resp., a,6 > 0) by 


6{a,b) := ( ^bPdp - 

Jo 


a — b 


log a — log b 


The following properties are taken from [9]; see Lemma 5.4. We write 9i[s, t) = 
di6{s, t) for i = 1,2. 


Lemma A.l. The following assertions hold: 
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(i) u 6 i {s,t) + V 62 (s, t) > 6 {u, v) for all s,t,u,v > 0 ; 

(ii) s9i{s,t) + t 62 {s,t) = 6 {s,t) for alls,t>0; 

(iii) 6 is symmetric, concave and increasing in both variables; 

(iv) 6 is positively 1-homogeneous, that is, for all X,s,t>0, we have 
6 {Xs, At) = X9{s,t). 

It follows directly from this lemma that for every Ai, A 2 > 0 and s,t>0, 

(24) {Xi9i{s,t) - X 292 {s,t)){s -t) < (max{Ai, A 2 } - min{Ai, A2})0(s,t). 

The following inequality plays a crucial role in the proof of the curvature 
bound for the Bernoulli-Laplace model and for zero-range processes. 

Lemma A.2. For any r > 0 and s,t>0, we have 

r{9i{s,t) -\- 92 {s,t)) - {9{r,s) -\-9{r,t)) > -9{s,t). 

Proof. If r = 0, the inequality is trivially true, so without loss of gen¬ 
erality we can assume that r > 0. Let u = sjr and v = tfr. Using the fact 
that 9 is 1-homogeneous, and 9i and 02 are 0-homogeneous, the inequality 
we wish to prove is equivalent to 

(25) 9i{u, v) + 92{u,v) — 9(1,u) — 0(1, u) > —9{u, v). 

Since 0 is concave, we have 

9{l,u) + 9{l,v) = 2 X ^{9{u,l) + 9{l,v)) 

= 9{u -I- l,u -I-1). 

Using the “curve below tangent” formulation of concavity applied to the 
function x 9{u + x,v x), we have 

0(u-|- l,u -I- 1) < 9{u,v) + 9i{u,v) + 02(u,u), 

and (25) immediately follows, which completes the proof. □ 
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